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Description 

s [0001] The present invention relates to a novel protease for cleaving the carboxyi side of a glutamic acid residue in 
an amino acid sequence of a polypeptide, and a DNA sequence for encoding the protease. 

Background of the invention 

jo [0002] As an enzyme for acting on a protein (poiy peptide) so as to specifically cleave the carboxyi side of a glutamic 
acid {Glut residue. V8 protease derived from Sta phyloco ccus aureus V8 strain is known (Gabriel R. Drapeau era/.. J. 
Bioi. Ghem. 247, 20, 6720-6728 (1972)). This enzyme is classified asa serine protease. C Carmona era/. (Nucl. Acids 
Res., 15. 8757 (1987)) cloned a DNA sequence for encoding this enzyme. 

[0003] A similar enzyme, acidic amino acid specific endopeptidase derived from Streptomyces griseus, which is an 
?s actinomyces. is also known (Norio Yoshida ef «/., J. Biochem. 104, 3, 451-456 (1988)). Recently, I Svendsen ef at. 
reported an amino acid sequence of the acidic amino acid specific endopeptidase purified from Sfreptomycesjriseus 
(FEES LETTERS 292 1 85-187 (1 991 )). Furthermore, glutamic acid specific endopeptidase derived from Bacillus sub- 
tifis is also known (Takuro Niidome ef a/.. J. Biochem. 108, 965-970 (1990)). With respect to some of these proteases, 
the characteristics thereof have been studied and inhibitors against fhem are known (K. Nagata era/., J. Biochem 110, 
SO 859-862 (1991) and T. Korniyama. J. Biol, Chem, 266. 17, 10727-10730 (1991 )). 

[0004] The aforementioned enzymes are useful for specificatiy cleaving a protein at the aforementioned site for the 
purpose of structural analysis of the protein: for obtaining an objective protein through cleavage of a fusion protein in 
the case where the objective protein is desired to be produced as the fusion protein by a genetic recombination tech- 
nique and the like. In the latter case, for example, after producing the objective protein by coupling with another protein 
23 through a Glu residue, the resultant is subjected to cleavage with one of the enzymes. 

[0005] Thus, the objective protein can be separated. Accordingly, there is a demand for a protease having such an 
enzymatic activity other than the above-mentioned enzymes. 

Disclosure of the Invention: 

30 

[000S] The objective of this invention is producing a novel polypeptide having protease activity for cleaving the car- 
boxyi side of a Glu reside, and a DNA sequence for encoding the protease. 

[0007] The present inventors have made various studies to obtain a polypeptide having the activity of an acidic amino 
acid specific endopeptidase (in particular, a glutamic acid specific endopeptidase) from a strain of a microorganism 
35 other than the above-mentioned microorganisms. As a result, they found a novel polypeptide having the above-men- 
tioned microorganisms. As a result, they found a novel polypeptide having the above-mentioned characteristic derived 
from Streptomyces fradiae, and studied the characteristics thereof in detail. Furthermore, they determined a DNA 
sequence for encoding this polypeptide thereby accomplishing toe present invention. 

[0008] The polypeptide of this invention comprises a mature peptide and a signal peptide, wherein said mature 
*> peptide is characterized by having protease activity for cleaving a peptide bond including a carboxyi group of a glutamic 
acid residue in a peptide and having an amino acid sequence from Vaf in the 1 position to Tyr in the 187 position of 
SEQ ID No, 1 and wherein said signal peptide comprises an amino add sequence from Met in toe -1 70 position to Tyr 
in the 1 87 position of SEQ ID No. 1 . 

[0009] In a preferred embodiment, the protease is derived from Streptomyces fradiae. 
45 [0010] In a preferred embodiment, the protease is derived from Streptomyces fradiae ATCC 1 4544 strain The DNA 
sequence of this invention encodes any one of the aforementioned polypeptides. 

[0011] In a preferred embodiment, the DNA sequence comprises a base sequence from A in the 435 position to C 
in the 1 505 position of SEQ ID No 1 

[0012] The polypeptide of the invention (hereinafter referred to as the SFase) is produced from Streptomyces fradiae. 
50 For example, it is produced from Streptomyces fradiae ATCC 14544 strain available from American Type Culture Col- 
lection (ATCC). 

Brief Description of the Drawings; 

55 [0013] Figure 1 shows the comparison in toe amino acid sequences between a mature peptide of SFase of the 
invention (upper lines) and a mature peptide of the same kind of an enzyme derived from Streptomyces griseus (lower 
lines). The portions shown with dots indicate toe same amino acids. 
[0014] Figure 2 shows an elution pattern of the SFase from S-Sepharose . 
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Preferred Embodiments for Practicing the Invention: 

[001 5] The present invention will now be described in detail in the order of the procedures 
s I. Culture condition: 

[0016] The SFase of the invention is produced by using Stregtomyces fradiae. for example, Streptomyces fradiae 
ATCC 14544 strain, in the case of culturing the strain that can produce the SFase, a medium used for the strain is not 
required to be special, and any ordinary medium can be used. For example, a medium including glucose, soybean 
>'0 meal, yeast extract, com steep liquor, potato starch and various salts may be used. The medium pH is 5 through 9, 
and preferably approximately 7.0. and the medium temperature is 15*C through SOX. and preferably approximately 
28 : C The strain is cultured, for example, aerobicaiiy for approximately 3 through 5 days with stirring or shaking The 
SFase is mainly secreted extraoelluiarfy. 

■?s )!, Method for harvesting the enzyme: 

[0017] In order to harvest and purify the SFase of the invention from the culture, one or a combination of known 
production methods can be used. For example, the culture solution may be centrifuged to obtain a supernatant, which 
may be purified by an appropriate means, thereby obtaining the enzyme of the invention. 
so [0018] For example, the supernatant may be precipitated with ammonium sulfate. The precipitate may be dissolved 
in a buffer solution, and the mixture may be subjected to separation by chromatography using S-Sepharose. Then, the 
resultant may be subjected to affinity chromatography using S~Sepharose again, thereby giving the present enzyme. 
The characteristics of the enzyme described below was measured by using this enzyme preparation. 

S3 HI, Method for measuring the activity: 

£00 1 9 J As a substrate, Z-Phe~Leu-G!u-pNA (wherein Z indicates carbobenzoxyl, and pNA indicates a p-nitroaniline 
residue) is used, and the substrate is dissolved in SO mfvl Tris-HCt (pH 7.5. including 2 mM CaCI^ and 2% DMF) so as 
to achieve a final concentration of 0.2 mM. An enzyme solution is added thereto, and allowed to react at a temperature 
so of 37 X for 10 minutes. The absorbanc© of p-nitroaniline released through the enzymatic reaction into the reaction 
solution is measured at 410 nm. The amount of the enzyme for attaining an absorbanc© of 1 .0 is defined as 1 unit (U). 

IV. Characteristics of the enzyme: 

55 [0020] The enzymatic and protein chemical characteristics of the SFase of the invention are as follows: 

(1) Enzymatic action and substrate specificity: 

[0021] 

40 

1 . Synthetic substrates listed in Table 1 were prepared and respectively dissolved in 50 mM Ths-HCf {including 2 
mM CaCI 2 , and dimethylfcrmamide (DMF) or dimethyl sulfoxide (OMSO) in the ratio shown in Table 1 ; pH 7.5) so 
as to achieve respective concentrations as listed in Table 1 . The present enzyme was added thereto to cause a 
reaction at a temperature of 25X. 8y measuring the absorbance (410 nm) of p-nitroaniline released through the 
45 enzymatic reaction into the reaction solution, the amount (A41 0 nm) of p-nitroaniiine released from each substrate 

per 1 minute was calculated, Tne results are listed in Table 1 . 
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Table 1: 
Substrate specificity: 

Substrate Concentration {final) A 410 nm 

ibM So 1 vent ( % } /'mg / mi n 



Z - Phe - Leu -Gl u - pNA 


0- 


20 


DMF 


0.2- 


319. 1 


Boc-Ala-Ala-Glu-pNA 


0. 


20 


DMF 


2.0 


126.2 


8oc-Ala~Ala-Asp~pN'A 


0. 


20 


DMF 


2,0 


14.3 


B2-Tyr-pNA 


0. 


02 


DMF 


2,0 


o*) 


Z-Trp-pNA 


0. 


20 


DMF 


2.0 


0*) 


Bs-Arg-pNA 


0. 


50 


DMSC 


i.O 




Z-6iy-Ser~pN& 


0. 


20 


DMF 


2.0 




Suc-Ala-Ala-Ala-pNA 


0. 


20 






o*) 


2 -Ala-Ala- Leu-pNA 


0. 


125 


DMF 


5.0 


0*> 


Leu-pNA 


2. 


00 






0*> 



Condition: 50 mM Tris-HCl, 2 mM CaCl 2 , 
pH 7.5, 25 a C 

Detection; A410 nm 
DMF; N, N -dimethyl formamide 
DMSO: dimethyl sulfoxide 
Z-; benzyloxycarbonyl- 
Boc-: t-butoxycarbonyl- 
Bz~: benzoyl - 
Sue- : succinyl- 
pNA: p-nitroanilide 

*) No color detected at A410 nm through a 
reaction for 15 minutes at a concentration of the 
enzyme of 4.2 ug/ml at a temperature of 23 °C. 

2. As a proteinous substrate, an oxidized B chain of irtsuiin was selected, and the action of the present enzyme 
against the substrate was examined. First, tie oxidized B chain of insulin was dissolved in 50 mU Tris-HC!. pH 
7.5, and the present enzyme was added thereto so as to achieve the enzyme/substrate = : 1/139 (W/W), and the 
reaction was effected for a determined period of time. The obtained reaction mixture was subjected to HPLC using 
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a Vydae Protein C4 column, and eluted for 30 minutes by a linear concentration gradient using 0% through 50% 
aeetonitrile in 0. 1 % TFA. The thus obtained fragments were analyzed with regard to the amino acid sequence. As 
a result:, it was found that a bond between Glu13 and Ala14 afone was cleaved. 

s [0022] From the results of the above described items 1 , and 2.. it is apparent that, the SFase of the invention is a 
glutamic acid specific endopeptidase for cleaving a peptide bond including carboxyl in a glutamic acid residue in a 
peptide. 

(2) Optima! pH and stable pH range: 

10 

[0023] As a substrate, Z-Phe-Lau-Glu-pNA was used. The substrate was dissolved in 50 mM buffers including 1 0% 
DMF {an acetate buffer {pH 3.2 through 6. 1 ), a phosphate buffer (pH 5.5 through 8.1 ), a Tris-HCI buffer (pH 6.1 through 
9.8) and a glycinafs buffer {pH 9.1 through 11.1}). The SFase of the invention was added thereto, and the reaction was 
effected at a temperature of 25^0 for 10 minutes. The absorfoance of p-nitroaniiine released through the enzymatic 
? s reaction into the reaction solutions was measured at 41 0 nm. When the reactions were effected in the reaction solutions 
having various pH, it was found that the optima! pH of the reaction was approximately 8.2. 

£0024] Next, under various pH conditions, the SFase of the invention was allowed to stand at a temperature of 3?*C 
for 1 .5 hours, and the reactions were effected in accordance with the activity measuring method, thereby measuring 
the residual activity. As a result, the stable pH range was found to be 8.0 through 9.0. 

so 

(3) Effect of an inhibitor: 

[0025] The effects of various inhibitors on the SFase of the invention are listed in Table 2 The SFase of the invention 
is completely inhibited by diisopropyl fluorophosphate (DFP), but is not inhibited by ethyienediaminetetraacefic acid 
2$ (EOTA) and o-phenanthroline (OP), which are inhibitors of meta! proteases. From this, the SFase of the invention was 
found to be classified as a serine protease. 

[002$} Further, the SFase of the invention is inhibited by neither Tosyi-Lys-chloromethylkstone (TICK) nor Tosyl- 
Phe-chloromethylketone (TPCK). which are inhibitors of trypsin and chymotrypsin, but is completely inhibited by Boc- 
Prte-leu-Glu-ohloromethylketone. From this, similarly to the result obtained with regard to the substrate specificity, the 
so SFase of the invention is found to be a glutamic acid specific endopeptidase, 
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Table 2: 
Inhibition: 

Inhibitor Concentration Residual 

of Inhibitor Activity {%} 



10 





Control 


— 


100 




Z-Phe-Leu-Glu-CK 


0.5 nuM 


0 


IS 


Boc- Ala-Ala- Aia-CK 


0.5 mM 


102 




TPCK 


0.5 mM 


101 






0.5 mM 


112 


so 


DFP 


2.0 m 


7 




DFP 


0.5 mM 


20 




DFP 


0.25 nuM 


3 3 


SO 


EDTA 


10.0 mM 


126 




EDTA 


1.0 mM 


113 




EDTA 


0.1 ffiM 


110 


30 


OP 


10.0 mM 


1G4 




OP 


1.0 mM 


1C8 




OP 


0.1 mM 


1C6 



35 



Buffer; 50 raM Tris-HCl, 2.0 mM CaCI 2 , pH 7.5, 

Preincubation: 37 *C, 1 hour 
Enzyme: 1.0 yg/ml 

Substrate: 0.2 mM 2-Phe-Leu-Glu-pNA, 0.2% DMF 

45 

(4) Molecular weight: 

{0027J By SDS-PAGE (1 5% gel, 1 .0 mm: molecular weight marker: RAINBOW™ Protein Molecular Weight Marker 
5:5 manufactured by Amersham), the molecular weight of the present enzyme was measured, and as a result, it was 
calculated to be 19,000. The molecular weight calculated based on the amino acid sequence determined from the 
gene sequence of the present enzyme described below is 18,702, which coincides with the value obtained by 
SOS-PAGE. 

55 (5) Amino acid composition: 

[0028] The SFase of the invention was hydrolyzed by using 4 M methanesuifonic acid [including 0.2% 3-(2-aminoe- 
thyl)indo!e] at a temperature of 1 1 0'C for determined periods of time (24, 48 and 72 hours). The respective hydrolyzed 
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products were analyzed with regard to amino acids by 3 Hitachi amino acid analyjsr (Model 835) The results are listed 
in Table 3. The amino acid composition determined based on the ONA sequence of the present enzyme (described 
below) are listed as well. These results apparently coincide with each other. 

s 

Table 3: 
Amino acid composition: 

10 

Value 







Mfaa^n red 

i At, V* V* A. 

Value 


e*!"fc 111131"? A 
from cBNA 




Asp 




i a 








j£X 


SO 


S&T 


"JO Q 






fin 








Pro 


2.9 


3 


25 


Giy 


33.5 


31 








xV 




uys, 


A A 


A 




vai 


XO. 3 






fiex 


X • X 


If 
X 




Tie 


6.1 


7 


35 


Leu 


5.9 


5 




Tyr 


10. 5 


10 




Phe 


4.3 


4 


40 


Lys 
His 


4.0 
3.8 


4 
4 


45 


Arg 


7.1 


7 




Trp 


0-9 






Res 




187 


50 


Mol 




1B702.4 



(6) Partial sequence of the amino acid: 

55 

1 . N-terminal amino add sequence: 

[0020] The amino acid sequence around the N-terminal was analysed by using 3 477A Protein Sequencer manu- 
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factored by Applied Biosystems. fn the analysis, the present enzyme was previously inhibited by DFR The amino acid 
sequence from the N-terminai to the 31 residue (corresponding to the amino acids from Vat in the 1 position to Ala in 
the 31 position of SEQ ID No. 1 ) is shown in Table 4. 

Table 4: 

N- terminal sequence: 



10 


Step 


Amino 
Acid 


pmol 




Step 


Amino 
Acid 


praol 




1 


Val 


2515.1 




17 




1048.6 




2 


Ala 


1771.8 




18 


Phe 


995. 6 




3 


Gly 


2175 . 8 




19 


Asn 


570.1 


so 


4 


Gly 


2378.4 




20 


Val 


1080.1 


5 


Asp 


1097.2 




21 


Thr 


660.4 




6 


Ala 


2195. 4 




22 


Lvs 


410. 8 


?5 


7 


lie 


2180.3 




23 


Asn 


455. 1 




8 


Tyr 


1341.4 




24 


Gly 


979. C 




9 


Gly 


1014.5 




2 5 


Val 


1054.4 


-30 


10 


Gly 


2154. 5 




26 


Arg 


647.2 




X. 1. 


Gly 


2227.4 




27 


Tyr 


464.9 




12 


Ser 


430.6 




2S 


Phe 


396.5 


35 


13 


Arg 


304.6 




29 


Leu 


514.0 




14 


X 


X 




30 


Thr 


280.3 




15 


Ser 


443.0 




31 




580.5 


40 


16 


Ala 


1002.9 














1 


11 


21 




31 





VAGGDAIYGGGSR-SAAFNVTKNGVRYFLTA. . . 



2. C-terminai amino acid sequence: 

[0030} The prese nt enzyme previously inhibited by DFP was allowed to react with lysyt endo peptidase. The reaction 
58 was effected in a 25 mM Tris-HCf buffer, pH 7.5, by using the inhibited SFase and the lysyt endopeptidsse at a ratio 
of 40:1 at a temperature of 24 C for 6 hours. Each of the obtained fragments was subjected to HPLC using a Vydac 
protein C4 column (4.8 x 25 cm), and separated by linear concentration gradient using 20% through 80% of CH 3 CN 
in 0.1% TFA under various conditions. As a result of the analysis of the fragments, a fragment having no-basic amino 
acid at a terminal of the peptide was found, and this fragment v^as identified as a peptide at the C-terminai. The amino 
55 acid composition at the C-terminai (corresponding to the amino acids from Glu in the 1 77 position to Tyr in the 1 87 
position of SEQ ID No. 1) is listed in Table 5, Further, a sequence assumed to be present within the SFase was also 
obtained (listed in Table 8: corresponding to the amino acids from Tyr In the 88 position to Arg in the 75 position of 
SEQ ID No. 1). 
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Table 5 t 

C-terminal sequence: 



Step 


Amino 
Acid 


praol 


Step 


Amino 
Acid 


pmol 


1 


Giu 


760.5 


7 


Giy 


839.0 


2 


Ala 


1176.9 


8 


vai 


1047.5 


3 


Leu 


1318.5 


9 


Asn 


603 . 4 


4 


Ser 


353.0 


10 


vai 


955.5 


5 


Ala 


1126.6 


11 


Tyr 


951.4 


6 


Tyr 


934.0 
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Table 6: 

Sequence of SFase (unknown region): 



Step 


Amino 
Acid 


pmol 


Amino 
Step Acid 


pmoi 


l 


Tyr 


1843.5 


6 


Asn - 


1460,9 


2 


Thr 


2375.8 


7 


vai 


2003 . 5 


3 


Thr 


2266.2 


8 


Asp 


1896.5 


4 


Thr 


2713.1 


9 


Gly 


1047.4 


5 


Thr 


2> § 2> i * X 


10 


Arg 


620.4 



1 10 
. . . YTTTTNVDGE . 



V. Determination of a DNA sequence for encoding the SFase: 

£0031] The terms used in the description of the present invention are defined as follows; 

[0032] An "oligonucleotide" indicates a short single-stranded ONA, and can be chemically synthesized by a known 
method. The oligonucleotide used in the present invention is chemically synthesized through purification by gel chro- 
matography using a Sephadex G50 column and high-performance liquid chromatography (HPLC) using a reverse 
phase silica gel column, if not mentioned otherwise. 

[0033] "PCR" is an abbreviation of polymerase chain reaction, which is a method for enzymatically amplifying a 
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desired region in DNA (Saiki ef al., Science, 239, 487-497 (1988)). First, DMA to be amplified is thermally denaturafed 
into a single-strand, and the region at the 3' terminal of each of the obtained single-stranded template DMA is annealed 
with complementary oligonucleotide primers (two types, one of which is a sense strand and the other an antisense 
strand). Then, using the action of ONA polymerase, DNA chains a re extended from the respective primers. By repeating 

s a series of these reactions, the objective DNA can be amplified a hundred thousand through a million times. 

[0034] A "Southern analysis" indicates a method for determining whether or not an objective gene is included in a 
DNA fragment obtained through cleavage by a restriction enzyme. For Southern analysis, DNA is first digested with a 
restriction enzyme that recognizes a specific base sequence in the two-stranded DNA for the cleavage thereof. The 
obtained digest is subjected to 1 % agarose gei electrophoresis, then denaturated through an alkali treatment into a 

>'0 single-strand, and the resultant is placed on a nylon Jitter to be transfered. An oligonucleotide or a ONA fragment, which 
is part of the objective gene. Is prepared and labeled to be used as a probe. The probe is subjected to hybridization 
with the single-stranded DNA on the nylon filter, thereby analyzing the ONA. 

[0035] Next, a method for determining a DNA sequence for encoding the SFase of the invention will be described 
in the order of procedures. The DNA sequence was determined by analyzing a genome DNA of Streptomyces fradiae 
*s ATCC 14544 strain through a method combining PCR. Southern analysis, a direct sequencing method and the lite. 

(1) Analysis of an internal sequence of a genome DNA by PCR: 

[0036] A sequence for encoding the SFase can be obtained, for example, from a genome DNA. For this purpose, a 
20 genome DNA of Streptomyces fradiae ATCC 1 4544 strain is first prepared from cultured ceils of the strain by a known 
method (J. Marmur et a!., J. Mol. Biol. (1961) 3, 208-218). The obtained genome DMA is used as a template DNA for 
PGR. An oligonucleotide primer used in PCR can be synthesized by a general method based on the amino acid se- 
quence around the N-terrninal of the purified enzyme obtained in the above-mentioned item IV (6); and the amino acid 
sequence of a peptide obtained by partially decomposing the enzyme. For example, an oligonucleotide for encoding 
S3 the amino acid sequence in the 1 position through 8 position on the M-terminat side of the SFase, i.e., Va! Ala Gly Gly 
Asp Ala lie Tyr, (up to the second base of a triplet for encoding Tyr at the C-terminal; 23-mer) is chemically synthesized 
to be used as a sense primer SF1. Since Streptomyces fradiae is known to be GC rich, a primer (SEQ ID No, 2) 
including only one kind of an oligonucleotide is selected to be used without considering degeneracy, 
[0037] Separately from the above, a peptide having the highest reliability is selected among the peptides that are 
33 obtained by partially decomposing the purified enzyme with lysyi endopeptidase and sequenced, and an oligonucleotide 
lis synthesized based on the selected peptide. As is described in Example 2 below, since a sequence of Tyr Thr Thr 
ThrThrAsn Vai Asp is obtained, a 24-mer complementary to the oligonucleotide for encoding this amino acid sequence 
is chemically synthesized to be used as an antisense primer SF2 (SEQ ID No. 3). 

[0038] By conducting PCR using template DNA, the sense primer SFt and the antisense primer SF2, the objective 
55 DNA chain in the genome DNA is extended and amplified. The obtained PCR product is analyzed through agarose 
get electrophoresis, thereby obtaining a DNA fragment having a length of approximately 220 bp, This DNA fragment 
is introduced into an appropriate vector to conduct subcioning, and then the DNA sequence is determined by a chain 
terminator method (Proc. Natl. Acad. Sci. USA Vol. 74. 5463-5467 { 1 977)). As a result, a DNA sequence of 225 bp is 
determined, and it is found that the amino acid sequence Tyr Thr Thr Thr Thr Asn Val Asp, which is used as a base 
*> for producing the antisense primer SF2, is an amino acid sequence positioned in the 88 position through 75 position. 

(2) Southern analysis of the genome DNA: 

[0039] The genome DNA derived from Streptomyces fradiae ATCC 14544 strain prepared in the above-mentioned 
45 item (1 ) is double-digested with restriction enzymes Sal! and Pstt, and tie resultant is subjected to agarose gel elec- 
trophoresis for separation. Then, the obtained DNA fragments are blotted on a nylon membrane filter for Southern 
analysis. As a probe for hybridisation, the PCR product of the SF1-SF2 of item (1), which was previously labeled 
with 32 P-dCTP by a genera! method, is used. Positive DNA fragments hybridized with this probe are exhibited as a 
band of 2.0 kb. 

SO 

(3) Determination of a base sequence of the genome DNA of the SFase; 

[0040] The genome DNA of Streptomyces fradiae ATCC 1 4544 strain obta ined in item ( 1 ) is cleaved with ana ppro- 
priate restriction enzyme, the resultant is introduced into an appropriate vector, and the DNA base sequence of the 
55 SFase is determined by the chain terminator method mentioned in item {1}. 

[0041] For example, the genome DNA of Stregtoray^s fradiae ATCC 1 4544 strain obtained in item (1 ) is digested 
with Sa tl a nd Pstl . The DNA fragment of approximately 2 kb confirmed in item (2) is subcloned with M 1 3mp1 0 previously 
treated with the same restriction enzymes, and a plaque hybridisation is conducted. At this point, as a probe, the probe 
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for hybridization used in the Southern analysis of item (2) can be used. Plasrnid DNA is isolated from the plaque which 
is found to be positive through plaque hybridisation, and a DNA sequence of the inserted fragment of this plasrnid is 
determined. As a result of the DNA sequencing, the DNA sequence of 2064 bp shown in SEQ ID No. 1 is determined, 
and SFase is found to comprise a DNA sequence for encoding a prepropeptide comprising amino acids of 1 70 residues 

s including methionine encoded by a translation initiation codon ATG and a mature peptide comprising amino acids of 
18? residues. Upstream, by eight bases, from ATG in the -170 position included in the 5' non-translation region, a 
sequence AAGGAG of a ribosome binding region is present. Further upstream, a -10 region and a -35 region both 
including a promoter-like sequence are found. On the 3' side of the DNA sequence for encoding methionine, a basic 
amino acid specific to a signal peptide is present, to which a hydrophobic amino acid is subsequent. Further, Ala-X-Ala 

*0 (wherein X indicates an optional amino acid), which is a signal peptide recognition sequence, is also present. Thus, if 
is found that a typical signal peptide is constructed. 

[0042] I Svendsen et al. (described above) reported an amino acid sequence, of the same type, of an enzyme 
derived from Stregtomyces griseus. When the amino acid sequence of this enzyme is compared with that of the SFase 
of the invention, the homology is found to be 81 .3%. Figure 1 shows the comparison between the amino acid sequences 
?s of the mature peptides of these two enzymes. The letters used in Figure 1 indicate the ordinary amino acids each with 
one letter. I. Svendsen eta!., however, did not perfonn gene cloning, and hence, the precursor, the promoter sequence 
and the like are unknown. 

[0043] The present invention provides a novel protease for specifically cleaving the C -terminal of a glutamic residue 
in the amino acid sequence of a polypeptide and a DNA sequence for encoding the protease This protease can be 
so prepared by culturing Streptoiriyces fradiae or by chemical synthesis. Such a protease can be used for a variety of 
purposes including analysis of a protein and cleavage of a peptide chain at a desired site in a fusion protein. The DNA 
sequence for encoding the protease can be used to produce and detect the protease. 

(Examples) 

25 

[0044] The present invention will now be described by way of examples 
[Example 1] 
so (Ha rvesf: of the SFase) 

[004$] Streptomyces fradiae ATCC 14544 strain was cultured in a medium having pH 7.0 and including 3% potato 
starch, 1 % soybean meal, 0.5% corn steep liquor. 0.5% glycerol, 0.3% NaCI, 0.35% CaC0 3 and 0.1% yeast extract 
at a temperature of 2S< C for 3 through 5 days (K. Morihara et a!., Biochim. Biophys. Acta., 139 (1967) 382-387). The 

3$ obtained culture solution was centrifuged (at 8000 rpm for 30 minutes) to obtain a supernatant by removing the fungi, 
80% Saturated ammonium sulfate was added to the obtained supernatant, thereby precipitating a protein. The thus 
obtained precipitate was centrifuged (at 8000 rpm for 30 minutes), and the collected precipitate was dissolved in 10 
fflM Tris-HCI, pH 7,5, including 2 mM CaCI 2 . Then, the obtained mixed solution was dialyzed using the same buffer 
solution as an outer solution. Next, the dialysis inner solution was lyophilized to obtain crude enzyme powder. Then, 

40 1 04.4 g of the crude enzyme powder was suspended in 1.2 1 of 5 mM Tris-HCI. pH 7.5, and the resultant was dialyzed 
again by using 20 L of the same type of the buffer solution as an outer solution for 20 hours. The obtained dialysis 
inner solution was centrifuged (at 8000 rpm for 20 minutes), thereby obtaining a solution of 1 .3 L. 
[0040] To the obtained solution, 1 .0 L of S-Sepharose (equilibrated with 5 mM Tris-HCI, pH 7.5) was added, and the 
obtained mixture was gently stirred at a temperature of 4'C for 5 hours. Then, the supernatant was discarded, the 

«s S-Sepharose having adsorbed the enzyme was charged in a glass column (1 1 .2x 10 cm) to be washed with approx- 
imately 3 L of 5 mM Tris-HCI, pH 7.5, and eluted with 5 mM Tris-HCI, 0.3 M NaCI, pH 7.5. 

[0047] Fractions having SFase activity were eluted from the column, collected (2.3 L), and dialyzed with 5 mM 
Tris-HCI, pH 7.5. for two days and nights (during which an outer solution of 20 L was exchanged three times). Approx- 
imately 2.5 L of the dialysis inner materia! was allowed to adsorb on approximately 500 ml of S-Sepharose {equilibrated 

SO with 5 mM Tris-HCI, pH 7.5) charged in a column (5 x 25 cm). Then, the column was washed with approximately 2.5 
L of the same type of buffer solution as that used in the equilibrium, and eluted with 5 I of a buffer solution, including 
0 through 0.1 M NaCI, by linear concentration gradient Then, fractions having SFase activity were collected (eluted 
at a concentration of approximately 0.05 M NaCI: approximately 1000 ml), and dialyzed overnight with 5 mM Tris-HCI, 
pH 7.5. Next the dialysis inner material was subjected to affinity chromatography. In the affinity chromatography, ap- 

55 proximately 70 ml of CH Sepharose 48-Phe-Leu-D-Glu-QMe was used as a carrier, which was charged in a column 
of 2.5 x 14 cm and equilibrated with 5 mM Tris-HCI, pH 7.5. The dialysis inner material was allowed to be adsorbed 
upon the column. The column was washed with approximately 500 ml of the same type of buffer solution as that used 
in the equilibrium, and eluted with 1 .0 L of the same type of buffer solution, including 0 through 0.3 M NaCI, by linear 
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concentration gradient. 

[0048] The SFase activity of the collected fractions was measured in accordance with the aforementioned activity 
measuring method The SFase activity was found in fractions eiuted around the NaCl concentration of 0.25 M. The 
thus obtained fractions having SFase activity were collected (0.2 L), and diaiyzed for 24 hours using 5 rnM Tris-HC!, 

s pH 7,5. as an outer solution (20 L x 2). Approximately 320 ml of S-Sepharose charged in a column of 3.2 x 40 cm was 
equilibrated with 5 rrtM Tris-HCI, pH 7.5, and the obtained dialysis inner materia! was allowed to foe adsorbed upon the 
column. Next, the column was washed with approximately 1 ,5 L of the same type of buffer solution as that used in the 
equilibrium, and eiuted with 4 L of buffer solution, including 0 through 0.1 M NaCl, by linear concentration gradient. 
[0049] The SFase activity of the collected fractions was measured in accordance with the aforementioned activity 

f o measuring method. Further, the absorbance of each fraction at 280 nm was measured to be used as a criterion for a 
protein concentration. Both results are shown in Figure 2 As is apparent from Figure 2. the SFase activity was found 
in fractions eiuted at a NaCi concentration of approximately 0.05 M (200 ml). The obtained fractions were diaiyzed for 
40 hours using 5 mM Tris-HCI, pH 7.5 (20 L each was exchanged three times) as an outer solution to be used as a 
preserved solution. The thus obtained SFase exhibited a single band in SDS-PAGE From 1 04.4 g of the crude enzyme 

is powder, approximately 3.0 mg of the enzyme was obtained (quantified using a Bio Rad Protein assay kit). 

[Example 2] 

(Sequencing of DNA for encoding the SFase) 

so 

(1 ) Analysis of an interna! sequence of a genome DNA by PCR: 

[0050] A genome DNA was prepared from cultured cells of Stieptomyces fradiae ATCC 1 4544 strain by the method 
of J, Marmur {described above), and the obtained DNA was used as a template DNA for PCR. An oligonucleotide 

2$ primer to be used in PCR was produced based on a determined amino acid sequence of part of the. SFase produced 
from Streptomyces fradiae ATCC 14544 strain First, an oligonucleotide for encoding the amino acid sequence in the 
1 position through 8 position on the N-terminal side of the SFase, i.e., Val Ala Gly Gly Asp Ala lie Tyr, (up to the second 
base of a triplet for encoding Tyr at the C-terrninal; 23-mer) was chemically synthesized to be used as a sense primer 
SFi . Since Streptomyces fradiae is known to be GC rich, a primer ($EG ID No. 2) including only one kind of oligonu- 

30 cleotfete was selected to be used without, considering degeneracy 

[0051] Next, the purified SFase obtained in Example 1 (after DEP treatment) was allowed to react with lysyi en- 
dopeptidase (manufactured by Wako Pure Chemicallndustries, Ltd.) at a temperature of 37'C for 5 hours. The resultant 
digest with the enzyme was decomposed and purified by high-performance liquid chromatography using a Vydac pro- 
tein C4, 3QQA column (4.6 x 25 cm) The amino acid sequences of the thus obtained digested fragments were checked 

35 with a 477 type protein sequencer manufactured by Applied Biosystems The amino acid sequence of one of the 
fragments was Tyr Thr Thr Thr Thr Asn Va! Asp, Then, a 24-mer complementary to the oligonucleotide for encoding 
this amino acid was chemically synthesized to be used as an antisense primer SF2 (SEQ ID No. 3). 
[0052] By using the template DNA, the sense primer SFi and the antisense primer SF2, the DNA was amplified by 
the PCR method [Saiki etal., Science 239, 487-491 (1989)] When part of the obtained PCR product was analyzed by 

*> 1 % agarose gel electrophoresis, a DNA fragment of approximately 220 bp was found. This DNA fragment was isolated, 
blunted through a treatment by a Klenow fragment, and subcloned with Ivti3mp10 previously treated with Sma! Then, 
the DNA sequence was determined by the chain terminator method (Sanger et a).: described above). As a result, a 
DNA sequence of 225 bp was determined, and the amino acid sequence corresponding to the antisense primer SF2 
was found to be positioned in the 68 position through the 75 position. 

45 

(2) Southern analysis of the genome DNA: 

[0053] The genome DNA derived from Streptomyces fradiae ATCC 14544 strain prepared in item (1) was double- 
digested with the restriction enzymes Sad and Pstf , and the resultant was subjected to 1 % agarose gel electrophoresis. 
50 Then, the obtained DNA fragments were blotted on a nylon membrane fitter, and Southern analysis was conducted. 
As a probe for hybridization, the PCR product of the SF1-SF2 of item (1) corresponding to the amino acid sequence 
from the 38 position to the 45 position of the SFase (SEQ ID No. 4; a probe SF3) was labeled with 3J P-dCTP by a 
general method. Positive DNA fragments hybridized with this probe were exhibited as a band of 2.0 kb. 

55 (3) Determination of a base sequence of the genome DNA of the SFase: 

[0054] The DNA fragments of approximately 2 kb obtained by digesting, with Sail and Pstl, the genome DNA of 
.?tX®Pi°m>'ces fradiae ATCC 1 4544 strain obtained in item (1 ) were treated with the same type of restriction enzymes, 
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and then ligated to depbosphorylated M13mp10. The ligation was performed by using a commercially available kit 
(manufactured by Takara Shuzo Co., Ltd.). 

JO055J Next, piaque hybridization was conducted. Atthis point, the probe SF3 used in item (2) was used as a probe 
Plasmid ONA was isoiated from the piaque which was found to be positive through piaque hybridization, and the DNA 
s sequence of the inserted fragment, of this plasmid was determined by the aforementioned chain terminator method. 
As a result of the DNA sequencing, the ONA sequence of 2064 bp was determined. The sequence is shown in SEQ 
ID No. 1. 

[0058] Through study of the DNA sequence of the SFase determined in the above-mentioned manner, if was found 
to comprise a DNA sequence for encoding a prepropeptide comprising amino acids of 1 70 residues including methio- 

*0 nine encoded by a translation initiation codon ATG and a mature peptide comprising amino acids of 187 residues. 
Upstream, by eight bases, from ATG in the -i 70 position included in the 5' non-translation region, a sequence AAGGAG 
of a ribosome binding region was present. Further upstream, a -10 region and a -36 region both including a promoter- 
like sequence were found. On the 3' side of the DNA sequence for encoding the methionine, a basic amino acid specific 
to s signal peptide was present, to which a hydrophobic amino acid was subsequent. Further, Aia-X-Aia (wherein X 

■?s indicates an optional amino acid), which is a signal peptide recognition sequence, was aiso present. Thus, it was found 
that a typical signal peptide was constructed. 

(Sequence Listing) 

so [0057] 

SEQ ID No. 1 

Length: 2064 
Type; nucleic acid 
as Strandedness: double 

Topology: linear 
Molecule type: Genomic DNA 
Original Source: 

Organism: Strojjtomyces fradjae 
so Feature: 

Name/key: -35 signal 

Location: 359. .364 

Identification method: by experiment 
55 Name/key: -1 0 signal 

Location: 378. .383 

Identification method: by experiment 

Name/key: CDS 

Location: 435.. 1505 
*> Identification method: by experiment 

Name/key: sig peptide 

Location: 435. .844 

Identification method: by experiment 

45 Sequence description: 
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GTCGACCGGC GCCGTGGTCG GCCAGCAGCC TTCGGCGGCG GCCGCGTCCG GCA.CCAACGA 60 

CAAGGCCGGC GCCCCGCAGA ACCTGATGCG CTGGACGCTG ACCCGCGCCA TCAAGGAGAC 120 

GCTGGTCCCG CCGACCGGGT ACGGCTACCC GCACATGCTG ACGCCCGTGC AGCTGACGCC 180 

CGCCCCGGCC CGTACGCACG GGCCGGCGGG CCGACGCCCC GCCCCGCCGC ACCGCGGCCG 240 

GGCTGCGGCC CGCGAGGCCG CCCCCTGACG CCGCCCCCT? CCGGCCCCCA CGCCGGGAGG 300 

GGGCGGTCTC CATGTCCGGG TCGTCGCGCC GGAGGCCGGG CGCGGCGGAA ACGCGCCCTT 380 

GCGGTCAGTG AAGCGT T CAT GTCAAGTTGC TCCCTCGGTA CGTCGCCACC CGTCACCGAC 420 

AAGGAGACCC CGGC ATG AG A CGC ACC ACC CGC GCG CGC ACC GGT CTG TCC 470 
Met Arg Arg Thr Thr Arg Ala Arg Thr Gly Leu Ser 
-170 -165 -160 
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gcc am ere ctc gcc 

Ala Leu Lea Leu Ala 
5 -155 

GCC GAC GCC CCC CAS 

Ala Asp Ala Pro Gin 
-140 

GCC CCC CTC CAC GCC 

.Ala Ala Leu His Ala 

~ X 2 3 

JAC AGC GCC GGC ACC 
Asp Ser Ala Gly Thr 
-110 

ACC GTC ACC ACC GAG 
Thr Val Thr Thr Glu 

SO 

-90 

ACG CCC CGG CGG GTC 
Thr Pro Arg Arg Val 
-75 

GCC GCC CTG GAG GCA 
Ala Ala Leu Glu Ala 
-60 

30 GAC CCG CGC ACC AAC 

Asp Pro Arg Thr Asn 
-45 

GCC CGC GAC CTG GCC 
35 Ala Arg Asp Leu Ala 
-30 

GCC GTC AGC GTC ACC 
Ala Val Ser Val Thr 

40 -10 

GGC GGC GAC GCC ATC 
Gly Gly Asp Ala lie 
5 

45 



GCC 


'TCC 


CTC 


GGC 


CTC 


GGC 


Ala 


Ser 


Leu 


Gly 


Leu 


Gly 








-ieo 






AGG 


CCC 


GCC 


CCC 


ACC 


CCG 


Arg 


Pro 


Ala 


Pro 


Thr 


Pro 






-13S 








CTC 


GAC 


GCC 


GCC 


GTC 


GAG 


Leu 


Asp 


Ala 


Ala 


Val 


Glu 




•120 










TAC 


GTG 


GAC 


GCC 


GGC 


ACC 


Tyr 


Val 


Asp 


Ala 


Gly 


Thr 


105 










-100 


GCC 


GCC 


GCC 


GCC 


AAG 


GTC 


Ala 


Ala 


Ala 


Ala 


Lys 


Val 










-as 




CAG 


CGC 


GGC 


GCC 


GCC 


GAG 


Ola 


Arg 


Gly 


Ala 
-70 


Ala 


Glu 


CGG 


GCC 


AAG 


ATC 




GGC 


Arg 


Ala 


Lys 


lie 


Pro 


Gly 






-55 








CGG 


ATC 


GCC 


GTG 


GAG 


GCC 


Arg 


lie 


Ala 


Val 


Glu 


Ala 




-40 










CGG 


CTC 


CGC 


AAG 


GTC 


GCC 


Arg 


.Leu 


Arg 


Lys 


Val 


Ala 


-25 










-20 




GTC 


CCC 


GGC 


GTG 


TTC 


Arg 


Val 


Pro 


Gly 


Val 


Phe 










-5 




TAC 


GGC 




GGC 


TCG 


CGC 


Tyr 


Gly 


Gly 


Gly 


Ser 


Arg 



10 



GCC GCC CCG GCC GGA 518 
Ala Ala Pro Ala Gly 
-145 

GCC TCC GAC TCC GCC 566 
Ala Ser Asc Ser Ala 
-130 

CGG ACC CTC GGC GAC 614 

Arg Thr Leu Gly Asp 

•115 

GGC GAA CTC GTC GTC 662 
Gly Glu Leu Val Val 
-93 

CGC GCG GCG GGC GCC 710 
Arg Ala Ala Gly Ala 
-80 

CTC GAC GCG GCC ATG 758 
Leu Asp Ala Ala Met 
-65 

ACC TCG TGG GGG CTG SOS 
Thr Ser Trp Gly Leu 
-50 

GAC TCC TCC GTC TCC 854 

Asp Ser Ser Val Ser 

-35 

GCC TCC CTC GAC GGC 902 
Ala Ser Leu Asp Gly 
-15 

CAG CGC GAG GTG GCC 950 
Gin Arg Glu Val Ala 

1 

TGC TCG GCG GCC TTC 993 
Cys Ser Ala Ala Phe 
15 
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10 



so 



35 



HQ 



Ml. 




Aw 


ft ft i*> 


"At 




b I i 






1 iVv 


^ W 








LAC 


1 vsL 


1U4& 


Asn 


^/st 1 

20 


'Thr 






<ji.y 


25 


Arg 


Tyr 






1 £iX 

30 






His 


Cys 




AC.;., 


AAt, 




1 lb 














4 WW 




AAA 




TCC 


ATC 


1094 


Thr 


Asn 


Leu 


Ser 


Ser 


Thr 


Trp 


Se.r 


Ser 


Thr 




Gly 


Gly 


Thr 


Ser 


He 




35 










40 










46 










50 




GGC 


GTC 


CGC 


GAG 


GGC 


ACC 


AGC 


TTC 


CCG 


ACC 


AAC 


GAC 


TAC 


GGC 


ATC 


GTC 


1142 


Gly 


val 


Arg 


Glu 


Gly 
55 


Thr 


Ser 


?he 


Pro 


Thr 
60 


Asn 


Asp 


Tyr 


Gly 


55 


Val 




~GC 


TAC 


ACG 


ACC 


ACG 


ACC 


AAC 


GTG 


GAC 


GGC 


CGG 


GTC 


AAC 


CTG 


TAC 


AAC 


1190 


Arg 


Tyr 


Thr 


Thr 
70 


Thr 


Thr 


Asn 


Val 


Asp 
75 


Gly 


Arg 


Val 


Asn 


Leu 
80 


Tyr 


Asn 




GGC 


GGC 


TAC 


CAG 


GAC 


ATC 


GCC 


TCC 


GCG 


GCC 


GAC 


GCC 


GTC 


GTG 


GGC 


CAG 


1238 


Gly 


Gly 


Tyr 


Gin 


Asp 


lie 


Ala 


Ser 


Ala 


Aia 


Asp 


Ala 


Val 


Val 


Gly Gin 








83 










90 










95 










GCC 


ATC 


AAG 


AAG 


AGC 


GGC 


TCC 


ACG 


ACC 


AAG 


GTC 


ACC 


AGC 


GGC 


ACC 


GTC 


1286 


Ala 


lie 

100 


Lys 


Lys 


Ser 


Gly 


Ser 
105 


Thr 


Thr 


Lys 


val 


Thr 

110 


Ser 


Gly 


Thr 


Val 




AGC 


GCC 


GTC 


AAC 


GTC 


ACC 


GTC 


AAC 


TAC 


AGC 


GAC 


GGC 


CCC 


GTC 


TAC 


GGC 


X 3' v5' 4 


Ser 


Ala 


Val 


Asn 


Val 


Thr 


Val 


Asn 


Tyr 


Ser 


Asp 


Gly 


Pro 


Val 


Tyr Gly 




115 










120 










125 










130 




ATG 


GTC 


CGC 


ACG 


ACC 




TGC 


TCC 


GCC 


GGC 


GGC 


GAC 


AGC 


GGC 


GGC 


GCC 


1382 


Met 


Val 


Arg 


Thr 


Thr 


Ala 


Cys 


Ser 


Ala 


Gly 


Gly 


Asp 


Ser 


Gly 


Gly Ala 












135 










140 










145 






CAC 


TTC 


GCC 


GGT 


TCC 


GTC 


GCC 


CTC 


GGC 


ATC 


CAC 


TCG 


GGC 


AGC 


TCC 


GGC 




His 


?he 


Ala 


Gly 


Ser 


Val 




leu 


Gly 


lie 


His 


Ser 


Gly 


Ser 


Ser Gly 










150 










155 










160 








TGC 


ACG 


GGC 


ACC 


AAG 


GGC 




GCC 


ATC 


CAC 


CAG 


CCG 


GTC 




GAG 


GCC 


1478 


Cys 


Thr 


Gly 
165 


Thr 


Asn 


Gly 


Ser 


Ala 
170 


lie 


His 


Gin 


Pre 


Val 

175 


Arg 


Glu 


Ala 




CTG 


AGC 


GCC 


TAT 


GGC 


GTG 


AAC 


GTG 


TAC 


TGACCCGCCC CGCCCGGCCG 




X w 2 5 


Leu 


Ser 


Ala 


Tyr 


Gly 


Val 


Asn 


Val 


Tyr 



















180 185 



55 
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10 



-i"*a y« ~ MA jpr<t 

GGCGGAGTGC 


CACAGTGGGc 


GGTATGAGTG 


ATCGTCTCCA 


TACCGGCCAi_ 


ACCTACGAGC 


1585 


TGGCACCGGC 


CGTGCTCGCC 


TCGATCCGCG 


CCCTCCTCCA 


GGACGCCTTC 


GAAGGCGACC 


1645 


TGAGCGAGGA 


GGACTGGGAG 


CACGGCCTCG 


GCGGCGTCCA 


CGCGTACGCC 


GAGGACCGCG 


1705 


GCGGGCGGAT 


CGTGGCGCAC 


GGCAGCGTCG 


TCATGCGCCG 


CGTGGTCCAC 


GCCGGGCGCG 


1755 


CCCACCGCG? 


CGGGTACGTG 


GAGGCCGTGG 


GCGTACGGGC 


GGACCGCCGC 


CGCCAGGGGC 


1825 


TGGGCACGCG 


GGTGATGGCG 


GCGCTGGAGG 


AGGTCCTCGA 


CGGGGCGTAC 


GACTTCGGCG 


l,BSi> 


CGCTCTCCGC 


CTCCGACGCG 


GGCGCCC-rGC 


TCTAGCTCGC 


GCGGC'TGGCA 


CCT6TGGAAC 


194S 


GGCCGCATCG 


AGGCGTACGG 


CCCGCACGGG 


GTGGTCCGGA 


TGGCGGACGA 


GGAGGACTGC 


2005 


ACCTTCCTCG 


GCCCTCGGCG 


GGCCGCCCGC 


TGCCCGCGCC 


GGAGGGCGGT 


CCCCTGCAG 


2054 



SEQ !D No. 2 
Length: 23 
Type: nucleic acid 
Strandedness: single 
Topology: linear 

Molecule type: other nucleic acid : synthesized DNA 
Sequence description: 

GTCGCCGGCG GCGACGCCAT CTA 



SEQ ID No 3 
Length: 24 
so Type: nucieic acid 

Strandedness: single 
Topology: linear 

Molecule type: other nucieic acid, synthesized DNA 
Sequence description: 

35 

GTCCACGTTG GTGGTGGTGG TGTA 



#> SEQ ID No. 4 

Length: 24 
Type: nucieic acid 
Btrandedness: single 
Topology: iinear 

45 Molecule type: other nucleic acid, synthesized DNA 

Sequence description: 

TCGTCCACCT GGTCGTCCAC CTCC 



CSairns 

1 , A polypeptide comprising a mature peptide and a signal peptide, wherein said mature peptide is characterised 
55 by having protease activity for cleaving a peptide bond including a carboxy! group of a glutamic acid residue in a 

peptide, and having an amino acid sequence from Va! in position 1 to Tyr in position 187 of SEQ ID No. 1, and 
wherein said signal peptide comprises an amino acid sequence from Met in the -170 position to Glu in the -1 
position of SEQ ID No. 1. 
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2. A precursor protein comprising an amino acid sequence from Met in the -1 70 position to Tyr in the 1 87 position of 
S EQ f D No . 1 . wherein 8 mature form of said precursor protein has a protease activity for ciea ving a peptide bond 
including a carboxy! group of a glutamic acid residue in a peptide. 

3. A polypeptide or protein according to claim 1 or 2 derived from Streptomyces fradiae, 

4. A polypeptide or protein according to claim 1 or 2 derived from Sfreptomyces fradiae ATCC 1 4544 strain. 

5. A DNA sequence for encoding any one of the polypeptide or protein of claims 1 to 5. 

6. A DNA sequence according to claim 5 comprising a base sequence from A in the 435 position to C in the 1 505 
position of SEQ ID No. 1. 



?s patenter) sprtiche 

1, Polypeptid, das ein reifes Peptid und ein Signaipeptid umfaSt, wobei das reife Peptid dadurch g e ken rszeich net 
ist, dass es eine Proteaseaktivitat zur Spaltung einer Peptidbindung. einschiiefilich einer Carboxyigruppe eines 
Glutaminsaure-Restes in einem Peptid, aufvveist und eine Arninosauresequenz von Val in Position 1 bis Tyr in 

so Position 187 der SEQ ID Nr. 1 aufweisf, und worin das Signalpeptid eine Aminosauresequenz von Met in Position 

-170 bis Glu in Position -1 der SEQ ID Nr. 1 umfasst. 

2, Ein Vorlauferprotein. umfassend eine Arnincsauresequenz von Met in Position -170 bis Tyr in Position 187 der 
SEQ ID Nr. 1 , worin eine reife Form des Voriauferproteinseine Proteaseaktivifatzur Spaitung einer Peptidbindung, 

25 einschlisfJIich einer Carboxylgruppe eines Glutaminsaure-Restes in einem Peptid. aufweist. 

3, Polypeptid Oder Protein gemafi Anspruch 1 oder 2. von Strepttwflyces fradiae abstammend. 

4, Polypeptid Oder Protein gemaS Anspruch 1 Oder 2, von Sfreptomyces fradiae Stamm ATCC 1 4533 abstammend. 

so 

5, DNA-Sequenz, weilcbe for irgend ein Polypeptid oder Protein gemSR Ansprtiche 1 bis 4 kodiert 

S. DNA-Sequenz gemafi Anspruch 5, welche die Basensequenz von A in Position 435 bis C in Position 1505 der 
SEQ ID Nr. 1 umfasst. 

35 

Revendications 

1 . Polypeptide comprenant un peptide mature et un peptide signal, dans iequel iedit peptide mature estcaracterise 
*> en ce qu'il possede una aetivite de protease pour diver une liaison peptidique inciuant un groups carboxyle d'un 

residu d'acide glutamique dans un peptide, et en ce qu'il possede une sequence d'acides amines depuis Val dans 
la position 1 a Tyr dans la position 187 de la SEQ. ID. n : 1, et dans Iequel iedit peptide signal comprend une 
sequence d'acides amines depuis Met dans ia position -170 a Glu dans la position -1 de la SEQ. ID. n : 1 . 

*5 2, Proteine precurseur comprenant une sequence d'acides amines depuis Met dans ia position -1 70 a Tyr dans la 
position 1 87 de la SEQ, ID. n '- 1 , dans iaquelie une forme mature de ladite proteine precurseur possede une aciivite 
de protease pour diver une iiatson peptidique inciuant un groupe carboxyle d'un residu d'acide glutamique dans 
un peptide. 

5:5 3. Polypeptide ou proteine sebn ia revendication 1 ou 2 provenantde Streptomyces fradiae. 

4. Polypeptide ou proteine seion la revendication 1 ou 2, provenant de la souche de Streptomyces fradiae ATCC 
14544. 

55 5. Sequence d'ADN pour coder Tun quelconque du peptide ou de la proteine des revindications 1 a 5. 

5. Sequence d'ADN selon la revendication 5, comprenant une sequence de base depuis A dans la position 435 a C 
dans ia position 1505 de la SEQ. ID. rr 1. 



18 



EP 0 647 710 B1 



Fig- 1 
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Fig. 2 
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